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Claim Amendment? 

Claim 1. (currently amended) A computerized storage and retrieval system of biological information 
comprising: 

a means for entering data [[entry] J; 

a means for displaying the data; 

a programmable central processing unit for performing automated analysis using semi -l in ear pathway 
representati on (SLIPR) and methods to compar e an d predict pathways and their components and to e stablish. 
a pathways database : [[and]] 

> 

a data storag e means for storing data; and 

a relational database containing a nnotated and curatcd protein pathways and annotat e d information on 
the pathways and their components otored in o r u lationol database, wher e in th e pathways are annotated and 
organized in a ourated clustering arrangem e nt and th e annotat e d information is aueouaud through the 
r e lational database . 

Claim 2. (currently amended) The system of claim 1, wherein the information pertaining to the pathways 
and their com ponents and [[is]] stored m a plurality of tables further comprises comprising proteins, their 
sequences and attributes; protein interactions; protein-protein associations; protein pathways; mRNA, 
rnicroarray, and protein expression data; genes, their sequences and attributes; [[and]] descriptions of cells, 
tissues, and organs[[,]]; pathology reports [[ Jliand patient histories [[,]] and treatments. 

Claim 3. (currently amended) The system of claim I, wherein the central processing unit is programmed to 
run an algorithm or a method for constrained c1usterin£ or an optimization algorithm for dynamic 
p r o gramming and one or more functions or standard methods selected from retrieve, input, edit, annotate, 
search, calculate similarities, align, compare^ optimize using linear next-neighbor criteria or global 
minimization criteria, and predict homologous or orthologous pFdtetft pathways and their components and 
wherein the method for constrained clustering is further selected from average linkage, single linkage, 
complete linkage, K-means. or self-organizing maps . 

Claim 4. (currently amended) The system of claim 1, wherein the central processing unit is further 
programmed lo perform one or more analyses selected from protein sequence analysis, protein interactions 
analysis, protein-protein association analysis, protein pathway analysis, gene expression analysis, pathway 
annotation analysis, pathway edit analysis, pathway expression analysis, tissue expression analysis, 
subtraclive hybridization analysis, electronic northern analysis, and [[or]] commonality analysis. 

Claim 5. (currently amended) The system of claim 1, wherein each ihc data is entered using the fftandord for 
pathway in the database further comprises at least two nodes connected by a mo de repres e ntation . 
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Claim 6. (currently amended) The system of claim 5 [[1]], wherein a moans for displaying the data in used to 
show two r e lated pathways as a diagram containing nod e s which r e pr e sent proteins or non prot e in molecules; 
mod e s that repr e s e nt prot e in int e ractions or protein prot e in associations; at least two pathways are com pared 
using scores calculated from sequence identity * motif or structural homologies that interrelate nodes [[;]] and 
employing coefficients of similarity calculated or selected from gen e expression including cl one counting of 
cDNAs^ESTs, promoter region s, or refzulatory sequences: electronic north ern analysis, immunoorecini tation, 
microarrav data, quantitative real-time polymerase chain reaction, transcript images, and yeast two hybrid 
data; q proinulcr siffu>rUy malrjfl, «. prulcin-prutcjij association, matrix and 3 similarities wmPflK protein 
interactions matrix CSCIM matrix! that interrelate modes of the pathways. 

Claim 7. (currently amended) The system of claim 1, wherein the central processing unit is programmed to 
compare perform pairwise comparison of at least two protein pathways or their components selected from 
[[by a]J node-only, [[a]] mode-only, and [[or a]] node-and-mode comparison and wherein the node-only 
comparison is farther selected from protein only, non-protein only, and protein and non-protein nodes. 

Claim 8. (currently amended) The system of claim 3 [[1]], wherein the central processing unit is- 
programmed to run runs the [[an]] optimization algorithm for dynamic programming further comprising: 

a) initializing an array, in which a two dimensional array M=Mj|j with J rows of nodes and variant 
length for each row, the length for i-th row is nj is set up and Mji=0, where K=i<=nj > 

b) backfilling the array via backward recursion with the formula 

A/ A =ra^ax|w(a a ,tf /T ) + ^ forl<i<w., 

where e(.) is the step function defined as e (v)«{0, if v<M); 1, if vX)} and w(. t .) is the scoring function 
between the two nodes, defined as 



0, if i= j, aik = aji , aik = -D, or aji - -D 

where D > 0; or 

;and 



c?[cik,ji-tc]# |c*(l- |sik- sji|) + (l- a) cik,jij otherwise 

c) using trao e back to id e ntify finding the best alignment to predict nodes of putative pathways PPWj, 
K^<=max ni with the top n best scores. 
Claims 9-22. (canceled) 

Claim 23. (currently amended) The system of claim 3 [[1]], wherein the central processing unit is- 
programmed to run a runs the constrained clustering algorithm further comprising: 
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a) assigning a distance between a protein in a query pathw ay and each protein every paif o f proteins in 
the database; 

b) merging at each round the [[two]] protein in the query pathway and the p rotein in the database with 
the closest distance [[pairs]] into a cluster until user-set threshold is met; and 

c) computing mean linkage between two clusters using 



whcrcff«]] n, is the size of c l and n 2 is the si>£ of c 2 and the distance between the two dusters is a weighted 



Sj= Sjq(j)) where q(i) is the query protein homologous to database protein i and protein-protein association 
fop and 



Claim 24. (canceled) 

Claim 25. (currently amended) The system of claim 7, wherein pathway - analysis uses a node-and-mode 
comparison predicts new homologous or ortholpgous pathways bv ooniprioiiig. 
a) — oubmitting a query pathway and prot e in s e qu e nc e s; 

b) comparing nodes using an optimization algorithm for dynamic programming or a constrain 

clustering, algorithm or standard methods and wherein a s e quence; identity score-or- p valu e siurunarizes- 
uimilarity and a w e ighting factor betwe e n 0 and 1 is assigned to correspondin g nod es;- 

0) — comparing modes by generating a SCIM matrix, wh e r e in a containing coefficients of similarity is- 
aaoignod to corresponding modes; 

d) aligning pathways globally or locally, whoroin insertion or d e l e tion of nodes or modoo inouru q 

penalty; 

0) summing oil otmikuity cooroc; and 

9 — displaying at least one high fl ooring se gm e nt of th e align e d pathways de rived from nod e and mode 

ootnpariflon , ther e by p e rforming pathway analysi s . 

Claim 26. (currently amended) A process for Th e system of claim 1, wher e in th e o e ntral prooeaoing unit is 
program med to porfe fm analyzing or predicting novel homologous or ortholoyous pathways analysis us i ng a 
submitt e d query pathway and protein sequences comprising: 

a) sub mitting a query pathway and protein seq ue nces; 

b) organizing the pathway and sequences using SLIPR th e standard for pathway representation; 




average of the distance between two proteins i and j and a function of similarity to query protein (si=Siq£) f 
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c) [[b)J] comparing each sequence protein sequences of the query pathway with each sequence aH- 
prot e in seq uemses in the pathways database using an algorithm for constrained clustering or an optimization 
algorithm for dynamic programming and standard methods of protein comparison to calculate sequence 

identity scares; 

d) [[c)]] using a SCIM matrix derived from protein inte ractions or protein-protein association analysis 
to compare coefficients of similarity for each interaction of the query pathway and each interaction aU- 
i nt e raction s for proteins in a pathway in the pathways database; 

e) [[d)]J calculating an overall-s imil arity score ( OS-score) based on sequence identity scores and 
coefficients of similarity for each pathway : and 

e) — r e moving all pathways not mooting a uoor cpeoificoVthroflhold for OS - soor e ; and 

f) retrieving aligaed pathways meeting a user-specified [[the]] threshold for OS-score , thereby 
performing analyzing or predicting novel homologous or orthologous pathways analysis . 

Claim 27. (currently amended) The system of claim 4 U3JJ, wherein pathways databas e i s s e arch e d for - 



b) - performing protein interactions analysis between a [[the]] query pathway and each pathway [[all]] 

pathways in the pathways database produces a wh e r e in coefficient of similarity i s produc e d to interrelate 
each mode of the query pathway and a mode of the most closely related protein pathway [[;]] 

o) . ■ retrieving at lea s t one pathway al i gnm e nt; and 

d) — showing the alignment based on th e s e arch for protein interaction s. 

Claim 28. (canceled) 

Claim 29. (currently amended^ The, process of claim 26. wherein Th e- sys tem of claim 3. wher e in a qu e ry 
pathway to se arch a pathways datnbara to pr e dict the predicted orthologous pathwnyo comprising* 

a) — submitting a qu e ry pathway and known protein se qu t mo e fe* 

-b) — comparing known s e qu e nc e s to all protein uoquonooo stored in the databooo; 

c) retri e ving pathway further comprises orthologous proteins with the highest sequence identity to the 

known proteins in the query pathway [[;]] and 

d) — inheriting protein interactions inherited from the query pathway [[; and]] 

- -o) — ali gning tho query pathway and th e orthologouf* protein s , th e reby pr e dicting orthologous pathways . 
Claims 30-36. (canceled) 
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